On the self-consistency of the excursion set approach 
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The excursion set approach provides a framework for predicting how the abundance of dark matter 
halos depends on the initial conditions. A key ingredient of this formalism comes from the physics of 
halo formation: the specification of a critical overdensity threshold (barrier) which protohalos must 
exceed if they are to form bound virialized halos at a later time. Another ingredient is statistical, as it 
requires the specification of the appropriate statistical ensemble over which to average when making 
predictions. The excursion set approach explicitly averages over all initial positions, thus implicitly 
assuming that the appropriate ensemble is that associated with randomly chosen positions in space, 
rather than special positions such as peaks of the initial density field. Since halos are known to 
collapse around special positions, it is not clear that the physical and statistical assumptions which 
underlie the excursion set approach are self-consistent. We argue that they are at least for low mass 
halos, and illustrate by comparing our excursion set predictions with numerical data from the DEUS 
simulations. 

PACS numbers: 



The excursion set approach [IHl] is the most widely 
developed formalism for estimating how halo abundances 
depend on the background cosmology. Rather than di- 
rectly predicting the comoving density of halos dn/dM 
in the mass range [M, M + dAl] , it provides an estimate 
of the mass fraction in halos, f(M) = (M/p) dn/dM, 
where p is the comoving background density. That is to 
say, the approach seeks to answer the question: What 
is the probability that a randomly chosen particle in a 
simulation is in a halo of mass Ml 

Building upon the seminal work of Press & Schecthcr 
[3], the excursion set assumes that at any random point 
in space the initial overdensity 8 performs a random walk 
as function of the smoothing scale R. Since initial fluctu- 
ations are small, this scale can be associated to a mass, 
M = pV(R), where V(R) is the filtered volume. Then, 
the multiplicity function /(M) is inferred from ensem- 
ble averaging over the number of trajectories that first- 
cross a non- linear collapse threshold (absorbing barrier) . 
This condition solves the so called "cloud-in-cloud" prob- 
lem due to miscounting the number of regions which are 
above the threshold at multiple scales. 

The ensemble properties of the walks are determined 
by the statistics of the initial density fluctuation field and 
the filter function, which specify how the variance of the 
walk height depends on R and hence on mass: (i5 2 ) = 



(7 = S where S 
sharp-k filter, W(k, R) = 
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J dkk 2 P(k)W 2 (k,R). For a 
'(1/R-k) and V{R) = 6tt 2 R 3 , 



it can be shown that random walks are uncorrelated, so 
the computation of the multiplicity function reduces to 
solving a simple diffusion problem [l|. In contrast, the 
top-hat filter in real space (sharp-x), W(x, R) = 9(x — R) 
and V(R) = (4?r/3) R 3 , observationally used to probe the 
linear density field, leads to correlated steps for which the 
computation of f(M) is more challenging. In this case 
the effects of correlated steps can be accounted-for per- 
turbatively using a path-integral approach 0-0] or other 
approximations (8l4l0l|. 

Another key ingredient concerns the physics of col- 
lapse. This defines a critical density which must be ex- 
ceeded if an object is to pull itself together against the 
expanding background. The simplest excursion set for- 
mulae assume a constant threshold as predicted by the 
spherical collapse model [Hj]. However, initial fluctua- 
tions are non-spherical and their evolution depends on 
the surrounding shear field [12| , so the ellipsoidal col- 
lapse model [13[ is assumed to be more realistic. This 
naturally gives rise to an approach with moving 3, ll| 
and/or stochastic 14 - critical collapse barriers. The 
excursion set approach implicitly assumes that the statis- 
tics of patches which are destined to form halos are no 
different from those of randomly placed patches. How- 
ever, halos have been shown to colla pse around special 
positions in the initial density field [14j corresponding 
mostly to initial overdense ellipsoidal patch of matter 
24(| . Hence, even assuming an ellipsoidal collapse bar- 
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rier, the model parameters for which the excursion set 
mass function reproduces that from N-body simulations 
may not be the same as those expected from the ellip- 
soidal collapse model. Most importantly, it is not a pri- 
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on guaranteed that the barrier model assumed in the 
excursion set calculation is also capable of reproducing 
the distribution of initial overdensities associated with 
N-body halos. In 1(| 21 , 22 1 the authors has proposed 
to implement a peak criteria into the excursion set theory 
prediction to get a physical insight 1 . Alternatively one 
can still work within the excursion set and calibrate the 
collapse par ameter s such as to reproduce N-body sim- 
ulation. 



encapsulates the main feature of the ellipsoidal colla pse . 



In 17 



- 20(| the authors has shown that a diffu- 
sive drifting barrier (DDB), reproduces the mass function 
from N-body simulations to within ~ 5% accuracy. 

In this Letter, we show that if one considers the statis- 
tics of all (rather than special) walks having to cross an 
effective boundary - one which may not be the same as 
that associated with the physics of collapse - then one 
can still build a self-consistent excursion set theory which 
reproduces with high accuracy N-body prediction of the 
halo mass function. We do so using halos identified with 
the Spherical Overdensity (SO) algorithm, with overden- 
sity parameter A = 200 in N-body simulations from the 
DEUS consortium [H] described in (HE^. The simula " 
tion box was 162/i _1 Mpc and 648/i _1 Mpc on a side with 
512 3 particles, realized using the RAMSES code [H for 
a ACDM model calibrated to WMAP-5yr data. 

The excursion set approach assumes that f(M)dM 
equals the fraction of walks which first exceed a critical 
value on a smoothing scale R corresponding to a mass 
M (= pV(R)). Since cr(R) is a deterministic function of 
M . if one thinks of the walk height 8 as being the sum 
of many steps, then the problem is to find the smallest a 
at which 
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(1) 



where 8 t h is a density threshold of collapse (i.e. barrier 
height). This may be a constant or vary with a provided 
it has the same value at any point in space. Averaging 
over all possible walks is like averaging over all positions 
in space; this yields an estimate of the "first-crossing dis- 
tribution", f(a)da. Since a and M are related through 
R, one gets the excursion set estimate of the mass frac- 
tion from setting f(M)dM = f(a)da. 

From the study of the ellipsoidal collapse it follows that 
the density threshold 8 t h not only can vary with a, but 
it may differ from point to point 12 , Q, 2^- 27| , thus 
exhibiting a stochastic behaviour. This motivates study 
of estimating the smallest a for which 



6 > S th = B(a) + B, 



(2) 



with B independent of 8. We will assume that B, like 
8, is a Gaussian variate, but with zero mean and vari- 
ance Dbct 2 , while B(a) is a deterministic function which 



Note this model is very different from the one in [22 1 
since the stochasticity depends explicitly of the scale we 
consider as expected from original ellipsoidal prediction 
lil 31 1 after marginalising over ellipticity and prolaticity 
of the ellipsoidal patch of matter in a Gaussian random 
density field. However the Gaussian distribution of the 
barrier does not correspond to the ellipsoidal prediction 
but as we mention before, since the excursion set the- 
ory assumes that proto halos form from randomly placed 
patches one might appeal to the central limit theorem 
and consider such model of the barrier or invoke that a 
Gaussian model is the simplest approximation one might 
consider to underline the stochastic behaviour of the bar- 
rier. Note also this double- Gaussian model turns out to 
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have a number of other interesting properties: see 
for details. 

Written this way, one may think of the right-hand-side 
of Eq. ([2]) as being a stochastic barrier which 8 must cross. 
While suggestive, this turns out to not be the best way 
to think of the problem, a point we will return to shortly. 
For example, the associated first crossing distribution is 
trivially solved by noting that S—B is itself Gaussian with 
mean zero and variance cr 2 (l + £>#), so this problem is, in 
fact, exactly the same as that of Eq. (JT|) . In particular, the 
first crossing distribution will have the same functional 
forim except for a rescaling of a 2 — > <j 2 {1+Db) as noticed 
by |6j. Note that, if both walks are smoothed with the 
same filter, then this is true whether or not the steps in 
each walk are correlated. 

There is a sense in which this stochastic barrier model 
differs fundamentally from a deterministic one. Namely, 
the condition 5 — B = B(a) may be satisfied at many 
different values of 8 [15}. We will use 8\ x to denote the 
value of 6 at first crossing, and will argue shortly that 
a combined comparison of f(u) and H(Si x ,a) with their 
estimates from N-body simulations allows for a nice test 
of the excursion set approach. 

In the special case where B(a) — 8 C + f3a 2 (with 8 C the 
spherical collapse threshold), and the steps are uncorre- 
cted, the first crossing distribution is [33J 



8 C 2a _ 
/o(o-) = — \ — e 
a V 7r 



(3) 



1 for realistic modelling of ellipsoidal peaks, this approach required 
a calibration of the collapse parameters using N-body simulation 



where a = 1/(1 + Db)- The corresponding distribution 
for tophat smoothing of both 8 and B is discussed in 
Section 2.4 of 0. 

Here instead, we will use tophat smoothing for 8 
but assume a Markovian random walk for B; i.e., we 
assume that the collapse conditions at different scales 
are uncorrelated, and, to match scales, we assume that 
(8 2 ) = (B 2 ). This requires that the scale of the sharp-k 
filter (which we use for B) must be about a factor of 2 
smaller than that for the sharp-x filter (which we use for 
8). Then we solve for the first crossing distribution by 
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walks in the variable B — 8 with average 8 C + f3S (as de- 
fined above) and variance (1 + D B )min(S, S') + A(5, S'), 
where A(S, S') is a function that accounts for the sharp-x 
filter induced correlations in S in order to have a coherent 
mass definition. Following Q this is well approximated 
as A(5,5') = kS/S'(S' - S) where k < 1 is set by the 
linear matter power spectrum, and the first crossing dis- 
tribution is 

f(v) = fo(i) + + /"(O + /ywW (4) 

with 
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where k = a k [181] 2 . 

For KCDM cosmology, <5 C = 1.673 and k = 0.465, so 
Eq. (Q} has two free parameters: /3 and Z>b- We deter- 
mine these by fitting Eq. (Q} to the DEUS halo counts, 
finding (5 = 0.074 and D B = 0.43. Fig. Q] shows that, 
for this pair of values, the discrepancy between Eq. Q 
and the numerical data at z — is ~ 5%, which is con- 
sistent with and with numerical uncertainties on 
the mass function [30| • Fig. Q] also shows that Eq. (jU 
provides a similarly good description of the associated 
first crossing distribution, confirming fljj . and illustrat- 
ing that the excursion set approach can provide a good 
description of halo abundances. 

We now turn to the question: Is this a self-consistent 
description? To answer this, we exploit the qualitatively 
new feature of stochastic barrier models: that walks 
which first-cross on scale a will do so at a range of val- 
ues of Si x . In the usual approach, H(8i x ,<j) is a delta 
function centered on S c . 

To see what it is here, we begin by noting that in 
the coordinate system (31,32) where 32 = B / 'y ' D B and 
31 = 8, the crossing condition 8 — B(S) + B defines a 
line. It is in this sense that it is better to not think of the 
"barrier" to be crossed as being "stochastic" , but as a 2- 
dimensional random walk crossing a deterministic barrier 
(the line). Moreover the fact that this barrier is a line 
means that it is much more convenient to think of the 



Eq. J7) includes a term 0(/3 2 ) corrected in [20 



FIG. 1: (Upper panel) /(cr) of the DDB model as given by 
Eq. Q (black solid line) and the corresponding numerical 
solution from Monte-Carlo simulations (black squares) for 
/3 = 0.074 and Db = 0.43. The red data points show the mul- 
tiplicity function of SO(200) halos from N-body simulations. 
(Lower panel) Relative difference with respect to Eq. Q . The 
thin black solid lines indicates 5% deviations. 



2-D walk as taking steps which are parallel and perpen- 
dicular to the barrier. Therefore, one should rotate the 
coordinate system to: 



.9+ = (5i 



1 )/^TW l 
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(8) 



Notice that (3+) = (g_) = 0, (g%) = (g 2 _) = a 2 , and 
(3+3_) = 0. At 8 lx = B(S) + B = B(S) + g 2 VD^, we 
have 



g + =B(S)/y/{l + D B ) 

g- = B{S) ^D B /{l + D B ) + 32 y 7 ! + D B 



(9) 



making 32 = 3_/yl + D B - B(S)^D B /{1 + D B ) and 
hence 



8\ x — B{S) + gzyDs 



B(S) 
l + D B 



9- 



D B 



1+D B 



(10) 



Since 3_ is a zero-mean Gaussian with variance cr 2 , the 
PDF of Si x at cr is that of g- , with mean shifted by fii x = 
B(S)/(1 + D B ) and variance rescaled by D B /(1 + D B ), 
hence 



Tl(8 lx ,S) 



( S lx -I'll) 



2irSDf 



(11) 



where D c ^ — D B /(1 + D B ). The argument above is in- 
dependent of the specific choice of B(S), so it is valid 
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for any value of the barrier, not just the spherical col- 
lapse case for which B(S) — 8 C . In the case of correlated 
walks due to smoothing 8 with a sharp-x filter one may 
expect a correction to the above formula. However, since 
in this coordinate system there is no absorbing bound- 
ary associated with the random walk of g_, then, the 
non-Markovian correction vanishes as shown in [5J and 
Eq. (fTTj) remains valid also in this case. 

We test the validity of this formula by comparing the 
distribution of values of 8\ x at a given a measured from 
direct Monte Carlo simulations of the excursion set walks. 
In practice, we run walks with a barrier given by /3 — 
0.074 and Db = 0.43 (the values required for our first 
crossing distribution to fit the DEUS mass function), and 
we measure ili^i^, er) in ln(cr) with bins of width 0.1. The 
results are shown in Fig. [5] at S = 1.5, S = 2 and 3. As we 
can see Eq. (fTTj) describes the Monte Carlo measurements 
well. 

If the excursion set approach is self-consistent, then 
H(6i x ,a), with parameters calibrated from fitting f(a) 
to halo counts in simulations, should provide a good de- 
scription of the distribution of 8 measured in the N-body 
simulations. To estimate this in the DEUS simulations 
we select a random particle for each halo of mass M200 
and evaluate the overdensity 8\ x within a sphere of radius 
R = (3Af20o/47r/o) 1//3 around it in the initial conditions. 
Choosing a random halo particle (rather than the one 
at the protohalo center, say) is crucial, since this makes 
the measurement correspond to the quantity which the 
usual excursion set calculation returns [35j]: i.e., an aver- 
age over all positions in the initial field. In contrast, the 
particle which lies closest to the initial center of mass rep- 
resents a special subset of all walks [14} . We will consider 
such walks shortly. 

Binning in er(M2oo) yields Tl{5\ x , a) shown in Fig. [2] 
Our values of S — 1.5 S — 2 and 3 correspond to masses 
M/10 12 M Q = 22,7.8 and 2.2 (566, 1467 and 6185 ha- 
los, respectively). We do not have enough halos to test 
smaller values of S. Fig. [2] shows that n((5i a; , a) for these 
halos is in good agreement with the excursion set pre- 
dictions (analytical and numerical) for the barrier model 
parameters which best-fit the halo mass function sug- 
gesting an impressive self-consistency of the excursion 
set approach at low mass halos. Indeed, we measure a 
mean and a variance of distribution which are respec- 
tively (1.277 ± 0.033, 1.255 ± 0.023, 1.314 ± 0.015) and 
(0.481 ± 0.032,0.627 ± 0.026,0.868 ± 0.020) to be con- 
fronted to the mean and the variance of our theoretical 
prediction (1.248,1.275,1.326) and (0.448,0.601,0.894). 

As already stressed, choosing a random particle from 
each halo, rather than the one at the center of mass is cru- 
cial. To see that this matters greatly, the (more sharply 
peaked) red histograms in Fig.[2]show 8\ x associated with 
center-of-mass particles in the same halos. The values as- 
sociated with halo centers-of-mass tend to be larger and 
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FIG. 2: Distribution of first-crossing overdensities in Monte- 
Carlo simulations (blue histograms) and our prediction 
Eq. (smooth solid black curves), for parameters cali- 

brated using the first crossing distribution shown in Fig. \T\ 
and the initial overdensities around randomly chosen halo par- 
ticles (black histograms) at S = 1.5, 2 and 3. Red histograms, 
which are more sharply peaked, show the same measurement 
but around the halo centers of mass. In this case, smooth 
curves show the best-fitting Lognormal. 



are almost always greater than 8 C . This is consistent 
with the analysis in [Tij ]. which implies that 8\ x for the 
center-of-mass particle in a halo will be larger than for 
any other particle in the halo. In addition, the distri- 
bution for center-of-mass particles is well described by a 
lognormal in agreement with previous work 36]. Over 
the range of mass considered we find 



n(<5 cm ,s) 



<5 cm \/2n D cm S 



(12) 



with fi cm = ln[(<5 c + /3 cm 5 7 )/g] - D cm S/2 and 
{/3 cm = 0.47, 7 = 0.615, q = 1.04, D cm = 0.0167}. At S = 
1.5,2,3 this expression leads to a mean value (8 cm ) = 
(2.188,2.301,2.492) while the N-body analysis gives 
(2.186±0.355, 2.289±0.012, 2.486±0.009). This is in rea- 
sonable agreement with the crude estimate from the el- 
lipsoidal collapse model of 14} , suggesting that one might 
be able to build a physically consistent model for center 
of mass walks. However, the same crude averaging which 
led to the predicted mean relation suggests that the rms 
scatter around it should be ~ 0.18cr while the data anal- 
ysis leads to (0.126 ± 0.008, 0.175 ± 0.007, 0.323 ± 0.007) 
and Eq. (12]) gives (0.121,0.179,0.316) at S = 1.5,2,3. 
Some of the discrepancy may be due to the fact that our 
estimate of <5 C m is broadened as a consequence of having 
assumed the initial volume to be spherical, when it is 
usually not 13711. 
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Although the excursion set approach does not pre- 
dict the distribution around this mean for center-of-mass 
walks. This shortcoming would be resolved if we had an 
excursion set model for center-of-mass walks; this is the 
subject of work in progress. In this respect, one must 
view our results as indicating that the excursion set ap- 
proach may provide a good effective model of the statis- 
tics of collapse, though its relation to the physics still 
remain unclear. Recently, [22| have argued that, within 
the context of peaks theory, one can build a model for 
the center-of-mass walks, from which H(5i x ,a) can be 
predicted. Note that this approach also requires the scat- 
ter amplitude of the barrier to be calibrated with N-body 
simulations. One might think that for these type of walks 
the PDF of Six matches the one of the center-of-mass 
walks. Unfortunately this consistency check has not been 
proved by the authors. 

Alternatively, we can perform a similar study of the 
excursion set theory but for a log-normally distributed 
barrier and test whether the distribution of 5\ x recovers 
that of center-of-mass particles. 

Overall, it is remarkable that our simple Gaussian 
model for the barrier yields a self-consistent excursion set 
approach and predict an analytic halo mass function with 
unprecedented agreement with respect to N-body simu- 
lation at the cost of only two collapse parameters which 
are coherent with the N-body analysis for the range of 
mass we investigate. Whether it is merely an effective 
approach or provides the stochastic master equation de- 
scription of the gravitational N-body problem still re- 
mains an open question which we hope to address in the 
future. 
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